A system of mobile agents to model social networks 



O 
O 

D 



Marta C. Gonzalez,''^ Pedro G. Lind,''^ and Hans J. Herrmann''^ 

' Institute for Computational Physics, Universitdt Stuttgart, Pfajfenwaldring 27, D-70569 Stuttgart, Germany 
^ Departamento de Fi'sica, Universidade Federal do Ceard, 60451-970 Fortaleza, Brazil 
^Centra de Fi'sica Teorica e Computacional, Av. Prof. Gama Pinto 2, 1649-003 Lisbon, Portugal 

(Dated: February 2, 2008) 

We propose a model of mobile agents to construct social networks, based on a system of moving particles 
by keeping track of the collisions during their permanence in the system. We reproduce not only the degree 
distribution, clustering coefficient and shortest path length of a large data base of empirical friendship networks 
recently collected, but also some features related with their community structure. The model is completely 
characterized by the collision rate and above a critical collision rate we find the emergence of a giant cluster 
in the universality class of two-dimensional percolation. Moreover, we propose possible schemes to reproduce 
other networks of particular social contacts, namely sexual contacts. 
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Friendships among a group of people, actors working in the 
same movie or co-authors of the same paper, are all examples 
of systems represented as networks, whose study imprinted to 
social networks an unquestionable place in the field of com- 
plex networks lOI^- However, the topological features of net- 
works of acquaintances fundamentally differ from other net- 
worked systems I2,i3j. First, they are single-scale networks 
and present small-world effect 0]. Second, they are divided 
into groups or communities 01. Additionally, their evolution 
process differs from standard growth models as those that gov- 
ern e.g. the World Wide Web. An interesting development in 
this area is given in jsl where it is proposed a simple proce- 
dure of transitive linking to generate small-world networks. 
While each one of the mentioned features can be reproduced 
with some previous model, there is still no single model that 
incorporates simultaneously dynamical evolution, clustering 
and community structure. 

In this Letter we show that all these characteristics can be 
reproduced in a very natural way, by using standard concepts 
and techniques from physical systems. Namely, we propose 
an approach to dynamical networks based on a system of mo- 
bile agents representing the nodes of the network. We will 
show that, due to this motion, it is possible to reproduce the 
main properties QJ 0] of empirical social networks, namely 
the degree distribution, the clustering coefficient (CC) and the 
shortest path length, by choosing the same average degree 
measured in the empirical networks, and adjusting only one 
parameter, the density of the system. The community struc- 
ture emerges naturally, without labeling a priori the commu- 
nity each agent belongs to, as in previous works fl]. More- 
over, this approach gives some insight to further explain the 
structure of empirical networks, from a recently available 
large data set of friendship networks 0| concerning 90118 
students, divided among 84 schools from USA, constructed 
from an In-School questionnaire. The acquaintance between 
pairs of students was rigorously defined. Each student was 
given a paper-and-pencil questionnaire and a copy of a list 
with every student in the school. The student was asked to 
check if he/she participated in any of 5 activities with the 
friend: like going to (his/her) house in the last seven days, or 



meeting (him/her) after school to hang out or go somewhere in 
the last seven days, etc. Other studies |i4] have used a slightly 
different definition of friendships and obtained the same kind 
of degree distribution, an indication of the robustness of the 
concept of friendship. 

Our model comprehends particles (agents) with radius 
r moving continuously in a square shaped cell of linear size 
L with periodic boundary conditions and low density p = 
N/L"^. One link (acquaintance) is formed whenever two 
agents intercept. After each collision, each colliding agent 
moves in a random direction with an updated velocity, till it 
collides again acquiring a new random direction, and so forth. 
In this way, the resulting movement alternates between drift 
(between collisions) and diffusion (collisions). Similarly to 
human communities, agents arrive and depart after a certain 
time of residence, the total number of agents remaining fixed 
in time, which enables the system to reach a quasi-stationary 
state. Initially all agents are placed randomly, with the same 
velocity modulus uq and random directions. At each time step 
At, the position Xj of agent i is updated according to 



x,(i + l) = x,(t)+v,(t)At. 



(1) 



After collisions velocity modulus of each agent, say i, is up- 
dated proportionally to its degree fc^, defined as the number of 
links connected to an agent i at time t: 



(2) 



where u is a constant having unit of velocity and Vq is the 
initial velocity of the agents, corresponding to a characteristic 
time To = 1/ {2\/2TTrpVo) between collisions. We assume that 
'age' Ai is the only intrinsic property of each agent i, initially 
randomly and homogeneously chosen from an interval [0, Tj], 
and updated as 



A(t + i) = A(t) + Ai. 



(3) 



When Ai ~ Ti, agent i leaves the system, all its links are 
removed, and a new agent replaces its position with the initial 
conditions stated above, namely velocity modulus vq and an 
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age randomly distributed in the range [0, Ti\. Therefore the 
time of permanence of an agent in the system is given by T; — 

A(0). 

After a certain transient the system reaches a quasi- 
stationary (QS) state. Thus, the degree distribution, degree 
correlations and community structure depend only on two pa- 
rameters, namely p and Ti/to- Figure^ illustrates the con- 
vergence towards the QS state for the average degree k[t) per 
agent. 
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FIG. 1 : (Color online) (a) Average degree k per agent as function of 
time t, illustrating the convergence towards a QS state {N = 4096). 
(b) Average degree (fc) vs. Ti/to for N — 10'*, averaged over 100 
realizations. Inset: linear dependence between (k) and A (see text); 
the solid line indicates (fc) = A/2. In all cases vo = \/2 and v = 1. 
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TABLE I: Critical exponents related to the emergence of the giant 
cluster for the network of mobile agents, compared to the ones of 
mean-field and 2D percolation. 

In Fig.^ we show the degree per agent (fc) vs. Ti/tq. For 
each value of Ti/tq the average degree was averaged over 
different snapshots in the QS regime, yielding a non-linear 
function of Ti/tq, which depends on the chosen density. An 
approximate analytical treatment of this dependence can be 
made and will be presented elsewhere. Further, the average 
degree is a function of the average number A of collisions dur- 
ing the average residence time T/ — {A) , and is defined as 
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As illustrated in the inset of Fig.^, we find {k) 
line), independently of the density. 

In the presented model, we find a critical value Ac = 2.04, 
beyond which a giant cluster of connected nodes emerges. Ta- 
bleUshows the values obtained numerically with the standard 
method of finite size scaling for systems of iV = 2^°. ..2^^, the 
results are compared with exponents for mean field and two- 
dimensional percolation {2D). Since the agents move on a 
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FIG. 2: (Color online) (a) Average shortest path length I and cluster- 
ing coefficient C as functions of the average degree {k). Empirical 
data (symbols) compared to simulations (solid lines), (b) Plot of 
Ti I To as a function of (fc) for the agents models (solid line). Stars il- 
lustrate two particular schools for Figs.|3|and|5|having Tj/to = 4.75 
(school 1) and 6.0 (school 2) respectively, (c) Second moment (fc^) 
for each school vs. the second moment of the corresponding simula- 
tion with the agent model (solid line has slope one). 



ID plane and have only a finite life time, they can only estab- 
lish connections within a restricted vicinity. This effect corre- 
sponds to a connectivity which is short range at each snapshot 
of the system. So, although our clusters are not quenched in 
time the underlying problem corresponds to short range 2D 
percolation. We have also explicitly calculated the correlation 
length as the linear size of clusters, and confirm that near the 
critical point this quantity diverges with precisely the same 
exponent v obtained from the finite size scaling. 
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FIG. 3: (a) Degree distribution P{k) averaged over all the schools 
(symbols) compared to P{k) of the simulations (solid line). The 
inset shows the results for a particular school (school 1). (b) Average 
degree Knn of the nearest neighbors as a function of k. Dashed 
and dotted lines indicate the Poisson and exponential distributions 
respectively, for the same average degree (k). 

The degree distribution P(fc) is a direct consequence of the 
collision rule, i.e. it depends on v in Eq. (|2}. For v — 0, 
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the degree distribution is well fitted by a Poisson distribu- 
tion, Pp(fc) = {{k)'' /k\)exp{-{k)). The degree distribu- 
tion obtained for v = 1, resembles an exponential of the form 
Pe{k) = ((fc) - l)"iexp(-(fc- l)/((fc) - 1)). However, 
while for small (fc) the degree distribution of the giant cluster 
is exponential of the form of Pe(fc), for larger (fc) it deviates 
from this shape. The same deviation as (fc) increases is in fact 
found in empirical data, e.g. the friendship networks of the 84 
schools. For each of the schools. Fig. shows the average 
shortest path length / (circles), and the CC (triangles). SoUd 
lines indicate the results obtained for the agent model using 
the same range of values of (k), averaged over 100 realiza- 
tions with N = 2209 and p = 0.1. Since / depends on the 
network size, it is divided by the shortest path length of a 
random graph with the same average degree and size. Clearly, 
the agent model predicts accurately both the CC and the short- 
est path length for the same average degree. 
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FIG. 6: Cumulative degree distribution of the number k of sexual 
partners in a real empirical network of sexual contacts (triangles) 
with 250 individuals, compared with the simulation of the agent 
model (solid line), the dotted line is a guide to the eye with slope 
2. Here iV = 4096, Ti/to = 5.5 and (fc) = 7.32 and the average 
size of the resulting sexual network is 220. 
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FIG. 4: (Color online) (a) Example of trajectories of 4 agents (en- 
closed in a box and enlarged in the inset) and 10 agents (showed by 
arrows) forming a 3-clique sketched in (b) and (c). 
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FIG. 5: (a) Distribution of community size s of 3-clique communi- 
ties for one particular school (school 2) (b) the corresponding aver- 
age over the 84 schools of the data set. Empirical data (symbols) 
compared to simulations (solid lines with error bars). 



By computing the average degree (fc) of each school one is 
able to obtain the value of T/ / Tq for which the agent model re- 
produces properly the empirical data, as illustrated in Fig.|2j). 
Here solid lines indicate the prediction curve for the agent 
model, while triangles indicate the values of Ti / Tq chosen to 
reproduce the social network of the schools with the resulting 
value of (k). Moreover, the second moment {k'^)ag obtained 
with the simulations of the agent model is a rescaling of the 
same quantity {k"^) sch measured for the empirical school net- 
works, as shown in Fig.|2l;. 

Figure |3^ shows the degree distribution averaged over all 
the schools, compared with the average of the ones obtained 
from the agent model simulations using the chosen values of 
Ti according to the relation sketched in Fig.|2j3. As one clearly 
sees, the degree distribution obtained with the agent model fits 
much better the empirical data than the exponential (dotted 
line) or Poisson (dashed line) distributions for a given (fc). 
The inset in the figure|3^ shows the comparison of the network 
of one particular school (school 1 in Fig.|2j, and the average 
over 20 realizations of its corresponding model (with Ti/tq = 
4.75). 

Degree correlations can be quantified by computing 
Knn{k), the average degree of the nearest neighbors of a ver- 
tex of degree fc |3|. Figure|3j) shows a good agreement of this 
value between real data and model for the same networks of 
Fig- Eh- Similar to other social networks the mixing is assor- 
tative 01, i-S- Knn increases with k, but in contrast to net- 
works with scale free degree distribution (i.e. collaboration 
networks), i^„„(fc) for friendship networks present a cutoff 
due to the rapid decay in the degree distribution. 

Further, the typical community structure found in social 
networks, , is also reproduced with the agent model. Here, 
we use a precise definition of network community recently 
proposed 1 8] based on the concept of fc-clique community. In 
Fig. 13 we plot the system of mobile agents, drawing only the 
trajectories of the agents which belong to two 3-clique com- 
munities, having 4 and 10 agents and sketched in Fig.|^ and 
Fig. 1^ respectively. Agents that form a community share a 
region in space and agents with larger trajectories are respon- 
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sible for building up the community. It should be pointed out 
that the agent motion in the system has not the straightforward 
meaning of human motion in physical space, but may be better 
related with affinities among individuals. 

Figure|5^ shows the size distribution of 3-clique communi- 
ties in a particular school (school 2) compared with the sim- 
ulation for the suitable value of Ti/tq (see Fig.|3, while in 
Fig- Eh the average over all schools is compared with the av- 
erage over 10 realization of the corresponding model for each 
school. In both cases, the agent model reproduces the distribu- 
tion of community size observed for the empirical data, partic- 
ularly the feature related with the existence of a big commu- 
nity having a large fraction of the population, namely s ~ 10'^ 
agents. 

In the particular case of sexual contacts it has been reported 
that the degree distribution presents a power-law j9j|. Figure 
|6lshows with triangles the cumulative degree distribution of a 
sexual contact network extracted from a tracing study for HIV 
tests in Colorado Springs (USA) with 250 individuals fioll . 
The dashed line indicates the degree distribution of a social 
contact network simulated with the agent model while the 
solid line is the degree distribution of a subset of contacts from 
the social network. The contacts in the subset are chosen by 
assigning to each agent an intrinsic property which enables 
one to select from all the social contacts the ones which are 
sexual. Namely, when two agents form a link, as stated be- 
fore, this link is now marked as a 'sexual contact' if the sum 
of the property values of the two agents is greater than a given 
threshold. These property values are assigned to the agents 



with an exponential distribution and the conditional threshold 
is In N/ 2, following the scheme of intrinsic fitness proposed 
in another context by Caldarelli et. al. fTlll . Interestingly, one 
is able to extract from the typical distributions of social con- 
tacts shown throughout the paper, power-law distributions in 
QS which resemble much the ones observed in real networks 
of sexual contacts. 

In conclusion, we presented a novel approach to construct 
contact networks, based on a system of mobile agents. For a 
suitable collision rule and aging scheme we have shown that 
one is able to produce quasi-stationary states which repro- 
duce accurately the main statistical and topological features 
observed in recent empirical social networks. The QS state of 
the agent model is fully characterized by one single parame- 
ter and yields a phase transition belonging to the universality 
class of two-dimensional percolation. Moreover, we showed 
that, by introducing an additional property labeling the abil- 
ity to select a particular type of social contact, e.g. sexual 
contacts, the degree distributions reduce to power-law distri- 
butions as observed in real sexual networks. Summarizing, 
we gave evidence that motion of the nodes is a fundamental 
feature to reproduce social networks, and therefore the above 
model could be important to improve the study and may serve 
as a novel approach to model empirical contact networks. 
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